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ABSTRACT 

The Drosophila Suppressor of Hairy-wing [Su(Hw)] 
protein is a globally expressed, multi-zinc finger 
(ZnF) DNA-binding protein. Su(Hw) forms a classic 
insulator when bound to the gypsy retrotrans- 
poson and is essential for female germline develop- 
ment. These functions are genetically separable, as 
exemplified by Su(Hw) f that carries a defective 
ZnF10, causing a loss of insulator but not 
germline function. Here, we completed the first 
genome-wide analysis of Su(Hw)-binding sites 
(SBSs) in the ovary, showing that tissue-specific 
binding is not responsible for the restricted devel- 
opmental requirements for Su(Hw). Mapping of 
ovary Su(Hw) f SBSs revealed that female fertility 
requires binding to only one third of the wild-type 
sites. We demonstrate that Su(Hw) f retention cor- 
relates with binding site affinity and partnership 
with Modifier of (mdg4) 67.2 protein. Finally, we 
identify clusters of co-regulated ovary genes 
flanked by Su(Hw) f bound sites and show that 
loss of Su(Hw) has limited effects on transcription 
of these genes. These data imply that the fertility 
function of Su(Hw) may not depend upon the de- 
marcation of transcriptional domains. Our studies 
establish a framework for understanding the 
germline Su(Hw) function and provide insights into 
how chromatin occupancy is achieved by multi-ZnF 
proteins, the most common transcription factor 
class in metazoans. 



INTRODUCTION 

Transcription factors execute complex gene expression 
programs important for the diversity of cellular pheno- 
types. These processes depend upon the proper action of 
enhancers and silencers that modulate transcriptional 
output of a promoter. Enhancers and silencers act over 
long distances and display limited promoter specificity, 
requiring additional mechanisms to ensure promoter 
selectivity. One class of elements that constrain enhan- 
cer and silencer action is insulators, regulators that 
are defined by two properties (1-3). First, insulators 
block enhancer-promoter communication in a position- 
dependent manner, such that an insulator prevents 
enhancer-activated transcription only when located 
between the enhancer and promoter. Second, insulators 
act as barriers that disrupt the spread of repressive chro- 
matin. Insulator function depends upon the assembly of 
protein complexes, initiated by a DNA-binding protein. 
Recognition motifs for insulator binding proteins repre- 
sent one of the most conserved non-coding DNA elements 
in metazoan genomes (4,5), indicating that insulator 
proteins have a critical role in transcriptional regulation. 

The Drosophila Suppressor of Hairy-wing [Su(Hw)] 
protein was one of the first insulator proteins identified 
(6,7). This 12 zinc finger (ZnF) protein plays a key role 
in establishing the insulator within the S'-untranslated 
region (UTR) of the gypsy retrovirus (8-11). The gypsy 
insulator is comprised of 12 tightly clustered Su(Hw)- 
binding sites (SBSs). The architecture of SBSs within the 
gypsy insulator plays an important role in its function, 
as deletion of binding sites or insertions into the insula- 
tor compromise enhancer blocking (12-14). Enhancer 
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blocking by the gypsy insulator requires Su(Hw)-depend- 
ent recruitment of two Broad-complex, Tramtrack 
and Bric-a-brac (BTB) domain cofactors, Modifier of 
(mdg4) 67.2 (Mod67.2) and Centrosomal Protein of 
190kDa (CP 190) (15,16), whereas formation of a barrier 
against the spread of repressive chromatin requires 
recruitment of the Enhancer of yellow 2 (ENY2) protein 
(17). The gypsy insulator is portable, with evidence that 
placing this insulator into transgenes confers protection 
from chromosomal position effects throughout the 
Drosophila genome (18,19), as well as in other organisms 
(20). Based on these observations, the Su(Hw)-binding 
region of gypsy has become a paradigmatic insulator 
and Su(Hw) a classical insulator protein. 

A large number of su(Hw ) mutants have been identified 
based on reversal of gypsy-induced mutations (7,21,22). In 
addition to suppressing gypsy-induced phenotypes, most 
su(Hw) mutants show defects in female germline develop- 
ment, wherein oocytes are lost due to mid-oogenesis 
apoptosis (22). Among the su(Hw) mutants identified, 
two alleles carry missense mutations that disrupt a 
single ZnF within the multi-ZnF domain. The su(Hw) E8 
mutation encodes a full-length Su(Hw) protein with a 
defective ZnF7, resulting in suppression of gypsy insulator 
activity and female sterility. These defects correlate with a 
loss of in vitro and in vivo DNA binding (23,24), indicating 
that both Su(Hw) functions depend on DNA recognition. 
The su(Hw/ mutation encodes a full-length Su(Hw) 
protein with a defective ZnF 10, resulting in suppression 
of gypsy insulator activity but retention of female fertility. 
This observation indicates that loss of ZnF 10 separates 
Su(Hw) functions. Su(Hw) f binds DNA in vitro, but 
displays reduced in vivo chromosome occupancy (24,25). 
The retention of fertility in su(Hwf mutants suggests that 
Su(Hw) f remains bound at SBSs essential for female 
germline development. 

The function of Su(Hw) in female germline develop- 
ment is not well understood. Emerging evidence suggests 
that the Su(Hw) germline function and insulator roles are 
distinct (25). In the present study, we investigated 
two questions to gain an understanding of the role of 
Su(Hw) in oogenesis. First, we asked whether Su(Hw) 
association with tissue-specific binding sites might account 
for its restricted developmental requirement. Second, we 
asked whether Su(Hw) f demarcates boundaries of 
co-expressed gene clusters in the ovary. To address these 
questions, we defined genome-wide binding of wild-type 
and Su(Hw) f , using chromatin immunoprecipitation 
coupled with deep sequencing (ChlP-Seq) and extensive 
validation of defined SBSs by quantitative real-time 
PCR (ChlP-qPCR). Our studies demonstrate that 
Su(Hw) binding is largely constitutive during develop- 
ment, suggesting that its ovary function does not involve 
tissue-specific binding. We show that Su(Hw) f is retained 
at one-third of wild-type sites genome wide, suggesting 
that the majority of SBSs are dispensable for oogenesis. 
These data demonstrate that loss of a single ZnF within a 
multi-ZnF domain has profound effects on chromosome 
occupancy genome wide. We demonstrate that multiple 
factors influence Su(Hw) f retention, including Su(Hw) 
DNA-binding affinity and partnership with Mod67.2. 



Using the Su(Hw) SBS dataset, we identified clusters of 
genes that are co-expressed in the ovary and are delimited 
by Su(Hw) f retained sites. Expression studies demon- 
strated that loss of Su(Hw) had limited effects on tran- 
scription of these genes. Based on these data, we suggest 
that Su(Hw) may not play a global architectural role in 
establishing genome regulation important for oogenesis. 

MATERIALS AND METHODS 

Chromatin immunoprecipitation coupled with 
deep sequencing 

Genome-wide association of wild-type and Su(Hw) f was 
determined using ChlP-Seq. These experiments used chro- 
matin isolated from ovaries dissected from females 
younger than 6-h old to provide an optimized balance 
between chromatin contributed by somatic and germ 
cells (25). Approximately 450 ovary pairs were used in 
each experiment, dissected from females raised at 25° C, 
70% humidity on standard cornmeal/agar medium. 
Dissected material was stored in 1 x PBS at —80 C° until 
needed. Chromatin was prepared as described previously 
(25), with the following modifications. Chromatin was 
cross-linked by incubating ovaries in PBS with 1.8% for- 
maldehyde. After lOmin at room temperature, samples 
were sonicated with the Fisher sonic dismembrator 100 
flat microtip, using eight pulses of 30s at 6W, 90s 
between pulses, producing an average size of 150-200 bp. 
The resulting chromatin was processed in three ways. 
First, 10% of the chromatin was heat treated to reverse 
cross-links and phenol-chloroform extracted, to generate 
the input DNA fraction. Second, the remaining chromatin 
was divided into two fractions. One fraction was 
incubated with guinea pig oc-Su(Hw) antibody (8 |il) (25) 
and the other with guinea pig pre-immune serum (8 jil). 
Enriched DNA was obtained by incubation with protein 
A beads (Sigma) and phenol-chloroform extraction. 
Single-end libraries for Illumina high-throughput seque- 
ncing were prepared from ~100ng of DNA from each 
fraction (Cincinnati Children's Hospital Medical Center 
Genetic Variation and Gene Discovery Core Facility, 
Cincinnati, OH, USA). 

Peak detection, validation and motif analyses 

Illumina Genome Analyzer IIx fastq files were processed 
using bowtie v. 0.12.5 software (26) to map sequence reads 
containing fewer than two mismatches to the fly genome 
(BDGP Release 5). Bowtie output files were analyzed with 
the Partek Genomics Suite v. 6.5 with the following par- 
ameters: the window size was 50 bp and peaks within 
100 bp of each other were merged. False discovery rate 
(FDR) for peak detection was set to 1%. Fold enrichment 
values were calculated based on the reference pre-immune 
ChIP sample. To define the Su(Hw)-binding motif within 
ovary chromatin, the top 500-fold enrichment peaks 
identified with Partek were evaluated using MEME 
v.4.4.0 (27). The sequence logo was generated using 
WebLogo v.2.8.2 (28,29). Information contents of the 
consensus sequences were calculated as described by 
(30). All ChlP-seq data were submitted to the NIH 
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GEO/Sequence Read Archive database, accession number 
GSE33052. 

Validation of individual Su(Hw)-binding regions was 
completed using qPCR. For these experiments, chromatin 
was isolated from many sources, including ovaries 
(75 pairs per ChIP), wing discs (200 per ChIP) and third 
instar larval brains with attached eye and antennal discs 
(referred to as larval brain, 150 per ChIP). Quantification 
was completed using SYBR green (BioRad) using primers 
designed to amplify 100-200 bp fragments centered on 
a Su(Hw)-binding motif (available upon request). 
The following formula (% input = 2 Ct(input) " Ct(lp) x 1 / 
DF x 100, where DF is the dilution factor between IP 
and input samples) was used to calculate ChlP/input 
cycle threshold change ratios. All analyses were performed 
in at least two biological replicates. 

To define the degree of conservation among SBSs, 
Patser v. 3b (31) was used to scan each peak region to 
identify highest scoring SBS. The position-specific weight 
matrix (PSWM) used by Patser was obtained from 
MEME using top 500 wild-type peaks. After determining 
the position of the SBS within the binding peak, we 
generated a phastCons score profile encompassing the 
highest scoring SBSs (20 bp of the binding site and 10 bp 
on each side). PhastCons score profiles were aligned and 
the median score at each position was used for further 
analysis. 

Analyses of overlap between SBSs with chromatin proteins 

Analyses were undertaken to determine whether SBSs 
overlap with binding of Mod67.2 and CP 190. Binding 
peak data for these factors were downloaded from the 
modENCODE project website (http://modencode.org/). 
Two binding peaks were considered overlapping if the 
peak center distance was shorter than half of the longer 
peak. This criterion ensures that the shorter peaks fall 
within the longer peak. Statistical significance of binding 
site overlap was determined using 1000 random sets of 
binding peaks; each random set had the same length dis- 
tribution as the real data. The P-value of site overlap was 
empirically determined as the fraction of random sets that 
have equal or larger number of overlapping sites. 

Analysis of Su(Hw) binding 

The in vitro DNA-binding properties of wild-type 
[Su(Hw) + ] and Su(Hw) f were determined using full-length 
His-tagged proteins purified from Escherichia coli DE3 
cells, using a protocol described previously (11). SBSs 
were isolated by PCR amplification of Canton 
S genomic DNA and cloned into StrataClone vectors 
(Agilent Technologies). Electrophoretic mobility shift 
assays (EMSAs) were used to define interactions 
between purified Su(Hw) + or Su(Hw) f with DNA, using 
DNA-binding conditions described previously (11). 
EMSA assays were used to determine the apparent asso- 
ciation constant (M _1 ) for Su(Hw) + and Su(Hw) f for 
selected 32 P-labeled SBSs (Supplementary Figure S2), 
using the non-linear least-squares analysis of a Langmuir 
binding equation for non-cooperative binding using 
Kaleidagraph (Synergy Software) (32). Interactions of 



Su(Hw) + or Su(Hw) f with bacterially purified full-length 
His-tagged Mod67.2 and CP 190 were analyzed using 
EMSAs. In these experiments, equal molar amounts of 
Su(Hw) + or Su(Hw) f were incubated with CP 190 and 
Mod67.2 (Supplementary Figure SI) in the presence of a 
32 P-labeled SBS. Total protein was normalized by 
addition of bovine serum albumin. 

Polytene chromosome staining 

Salivary glands were dissected from wandering third instar 
larvae into Cohen Buffer (10 mM MgCl 2 , 25 mM 
Na 2 GlyceroP0 4 , 3mM CaCl 2 , 10 mM KH 2 P0 4 , 0.5% 
NP40, 30 mM KC1, 160mM Sucrose), fixed for 3min in 
formaldehyde (100 mM NaCl, 2mM KC1, 2% Triton 
X-100, 2% formaldehyde, 10 mM NaP0 4 pH 7.0) and 
2min in squashing solution (45% acetic acid, 2% formal- 
dehyde). Glands were squashed and flash frozen in liquid 
nitrogen. Following washes, slides were stained for 1-2 h 
with guinea pig anti-Su(Hw) (1:250), sheep anti-CP 190 
(1:100) and chicken anti-Mod67.2 (1:500) antibodies in 
PBS containing 10% normal goat serum or lOmg/ml 
non-fat dry milk in the case of CP 190, 5mg/ml gamma 
globulins. Slides were incubated 1 h with goat anti-guinea 
pig Alexa Fluor 488 (A11073) at a 1:1000 dilution. 
Following DAPI (4 / ,6-diamidino-2-phenylindole) staining, 
slides were mounted in Vectashield H-1000 (Vector 
Laboratories) and imaged using a Leica DMLB or Zeiss 
710 confocal microscope. Images were processed using 
ImageJ and Adobe Photoshop. 

Quantitative PCR analyses of ovary gene expression 

Ovaries were dissected from virgin females 4-6-h 
post-eclosion. Total RNA was isolated using TRIzol 
(Invitrogen), DNase treated with DNA free kit 
(Ambion) and reverse-transcribed using High Capacity 
cDNA kit with random hexamer primers (Applied 
Biosystems). Quantitative PCR analyses were performed 
using iQ SYBR green supermix (BioRad). Three inde- 
pendent biological replicates were analyzed. Expression 
level of each gene was determined relative to RpL32 as 
an internal control (AC t ). 

RESULTS 

Genome-wide mapping of SBSs in the ovary 

Previous genome-wide mapping of Su(Hw) in two embry- 
onic cell lines suggested cell type-specific binding (33). 
These observations indicated that the restricted develop- 
mental requirement for Su(Hw) may result from 
tissue-specific binding in the ovary that contributes 
unique regulatory functions. To test this postulate, we 
used ChlP-Seq to define SBSs in the ovary. Chromatin 
was isolated from ovaries dissected from females <6h 
old, because these ovaries contain a nearly equal contri- 
bution of chromatin from germ cells and somatic cells. 
The oc-Su(Hw) ChIP sample generated 21 million 
uniquely mapped tags, with ~13 and ~14 million 
obtained from ChIP with pre-immune serum and input 
chromatin, respectively (Supplementary Table SI). 



5418 Nucleic Acids Research, 2012, Vol. 40, No. 12 



chr2L:| 
800 



47000001 



47500001 



48000001 



48500001 



Ovary 
a-Su(Hw) 
aligned 
reads 

o . .^ 

preimmune IP o_^ 
input o 
Ovary 
Embryo-PG ab 
Embryo-VC ab 
BG3 
S2 

S2-VC ab 
Kc157 
Mbn2 
RefSeq genes I 



4900000I 



4950000I 5000000I 



La , 



-■I Li. .... y .. 1 to^hJ^jjMi J 1 ■ 



I I 
I I 
I I 
I I 



I I 

I 



ll I 
1 1 



I II 



H Hf 



Him m hmmwmh — \-mm in— »ih — himii 



1 

INI 



I I 



B 



input 



30- 
25- 
20- 
15 ■ 
10 ■ 

5- 

1 
0 



ChIP: a-Su(Hw) 
ovary chromatin 



..Hill 



fold 
enrichment 



80 



70 



60 



50 



40 



30 



20 



10 



4.9E-05 1.5E-03 



negative 



first second third 



fourth 



quartile 



i 



2932 1800 14 

<f y </« 



Figure 1. ChlP-Seq analysis of SBSs in the ovary. (A) Shown is a UCSC Genome Browser view of a representative 360 kb region of chromosome 2L. 
Several tracks are shown, including Su(Hw) WT ovary ChlP-Seq reads, the pre-immune serum IP control reads, and the input chromatin control 
reads. The ovary peaks (1% FDR) defined from our data are compared to SBSs identified in (33,34). These latter datasets include peaks identified in 
embryos using the PG or VC antibody and four cell lines (BG3, S2, Kcl57 and Mbn2). Examples of ovary-gained and ovary-lost sites are highlighted 
(red and blue bars). (B) Shown are data from validation studies of SBSs identified in ChlP-seq. ChlP-qPCR was completed on two biologically 
independent chromatin isolations, distinct from that used for ChlP-seq. SBSs are arranged into four quartiles based on the level of Su(Hw) 
occupancy predicted from ChlP-Seq data. Six randomly chosen SBSs from each quartile were tested. Error bars indicate standard deviation. 
Negative controls represent sites that lack a Su(Hw)-binding motif. These sites were not identified in the ChlP-Seq dataset. (C) Box plot of fold 
enrichment values of all SBSs identified in the ovary (white), SBSs identified in all genome-wide studies (constitutive, gray), and SBSs identified only 
in the ovary dataset (ovary only, black). Boxes represent the 25-75 percentile interval, with the median enrichment indicated by the line. Whiskers 
represent the non-outlier range. P-values of Student's /-test are indicated. 



Using a 1% FDR, we identified 2932 SBSs with a median 
peak size of 293 bp (Figure 1 A, Supplementary Table S2). 
To validate the ovary SBS dataset, we completed 
ChlP-qPCR analysis on newly isolated ovary chromatin. 



In these studies, we analyzed four sets of six SBSs repre- 
senting each enrichment quartile and six genomic regions 
that were negative in our ChlP-Seq data and lacked a 
Su(Hw)-binding motif. These analyses demonstrated that 
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all 24 ChlP-seq SBSs displayed 1% or greater enrichment 
relative to input (Figure IB), with the level of SBS enrich- 
ment in the ChlP-qPCR analysis generally related to that 
found in the ChlP-seq data. Comparison of ChlP-qPCR 
values for true SBSs relative to the negative control 
regions indicates that a 1% input discriminates between 
real and background Su(Hw) binding (Figure IB). For 
this reason, we used this empirically defined value to 
evaluate significant Su(Hw) occupancy in subsequent 
experiments. Together, these experiments demonstrate 
that we have generated a high-quality dataset of 
wild- type Su(Hw) ovary binding sites. 

To determine whether Su(Hw) was associated with 
ovary-specific binding sites, we defined the extent of 
overlap between our ovary dataset and those generated 
from ChlP-Chip studies of chromatin isolated from em- 
bryonic and larval cells (Figure 1A, Supplementary Table 
S2) (33-35). These analyses showed that -99% of the 
ovary SBSs were found in at least one other dataset, 
with nearly two-thirds found in all sets (61.4%, —1800 
SBSs; referred to as constitutive SBSs). As a result, only 
14 (0.5%) of SBSs were found exclusively in the ovary 
dataset (referred to as ovary-gained sites), while 325 
(—11%) were not found in the ovary dataset (referred to 
as ovary-lost sites). Together, these data predict that few, 
if any, SBSs are ovary specific. 

We reasoned that differences among SBS datasets might 
represent small, but functionally important tissue-specific 
changes in chromosome association. Alternatively, 
distinct SBS identification may be related to technical dif- 
ferences in experimentation, such as antibody source or 
analysis software. To distinguish between these 
possibilities, we compared the fold enrichment values for 
the SBS classes, which revealed that the median fold en- 
richment for the constitutively bound sites was signifi- 
cantly higher than that of the ovary-gained sites (Figure 
1C). These observations imply that Su(Hw) occupancy at 
ovary-gained sites is at the threshold for detection, ac- 
counting for the absence of these sites in other datasets. 
To determine whether the ovary-gained sites were repro- 
ducibly identified, we randomly selected ten sites for direct 
analysis by ChlP-qPCR. As controls, we analyzed 20 con- 
stitutive SBSs and 20 ovary-lost sites (Figure 2A). We 
found enrichment values for all constitutive sites at or 
above the 1% input threshold. Whereas two constitutive 
sites showed occupancy at 1-3% input, the rest were 
above 5% input. In contrast, the majority of ovary-gained 
sites (7/10) displayed enrichment near the 1% threshold, 
with only one site above 5%. These data suggest that 
ovary-gained sites are valid, but low occupancy SBSs. 
Analysis of ovary-lost sites showed that the majority 
(17/20) were below the 1% threshold. These findings 
confirm that our peak detection was robust, possibly 
because ChlP-Seq has a greater dynamic range than 
ChlP-Chip, which permitted better discrimination of 
ChIP peaks. 

Our analyses identified a small number of ovary-gained 
sites. To determine whether these SBSs are tissue specific, 
we tested Su(Hw) occupancy at these sites in two other 
tissues. To this end, we conducted ChlP-qPCR using chro- 
matin isolated from third instar larval wing discs and 



brains, testing Su(Hw) occupancy at the same set of con- 
stitutive, ovary-lost and ovary-gained SBSs (Figure 2B 
and C). Again, all constitutive SBSs were bound above 
the 1% threshold, with the level of occupancy generally 
consistent between tissues. The majority of ovary-lost 
SBSs (15/20 or 17/20) were below the 1% input threshold 
in both tissues, with the remaining SBSs displaying enrich- 
ment near 1% input. These findings suggest that many 
ovary-lost sites are low occupancy sites at the threshold 
for detection or possibly false positives in other 
datasets. MEME analyses showed that half of the 
ovary-lost sites contain a match to the Su(Hw) 
DNA-binding motif (data not shown), supporting that 
many sites in this group are not false positives. 
ChlP-qPCR analysis of the ovary-gained SBSs showed 
that many were bound in wing disc and brain chromatin, 
although signals for most were near the 1 % input thresh- 
old. Based on these data, we conclude that SBSs originally 
identified as ovary-specific are found in other tissues, sug- 
gesting that Su(Hw) binding is largely constitutive 
throughout development. 

Analysis of properties of SBS subclasses 

SBSs fall into three classes based on partnership with the 
gypsy insulator proteins Mod67.2 and CP190 (34). These 
include (i) SBSs that do not bind the gypsy partners 
(SBS-O), (ii) SBSs with CP 190 only (SBS-C) and (iii) 
SBSs with CP190 and Mod67.2 (SBS-CM) (34). While 
bulk properties of SBSs are reported (33), no studies 
have evaluated whether SBSs display class-specific 
properties. Two observations suggest that SBSs identified 
in our ovary ChlP-seq studies could be used as the basis 
for further understanding properties of SBS subclasses. 
First, we have demonstrated that Su(Hw) binding is 
largely constitutive (Figure 2), suggesting that our ovary 
SBSs dataset accurately reflects Su(Hw) binding genome 
wide. Second, our previous studies indicate that Su(Hw) 
partnership with Mod67.2 and CP 190 is largely 
unchanged during development (25). This conclusion is 
based on immunohistochemical analyses of nurse cell 
polytene chromosomes obtained from the ovarian tumor 
mutant, which showed that co-localization of Su(Hw), 
Mod67.2 and CP 190 parallels that observed on salivary 
gland polytene chromosomes (25). Further, direct qPCR 
analyses demonstrated that -90% (10/11) of SBSs that 
bound CP 190 in embryos also bound CP 190 in the 
ovary (25). Taken together, these data imply that the 
Su(Hw) partnership with Mod67.2 and CP 190 is main- 
tained in the ovary. 

Several properties of SBS subclasses were tested. First, 
we determined the distribution of SBSs among the three 
classes. We found that SBS-O sites represent the most 
prevalent class, corresponding to 54% of SBSs, while 
SBS-CM sites were 29% and SBS-C sites were 17% 
(Figure 3A). These observations imply that most SBSs 
are not associated with the gypsy insulator proteins. 
Second, we used the DNA motif search program 
MEME to define the sequence of the Su(Hw)-binding 
motif associated with each class. These analyses revealed 
that SBS subclasses display only minor differences in 
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information content (Figure 3B), indicating that changes 
in the binding motif are not responsible for differences in 
partner association. Third, we defined the median 
phastCons scores for 40 nucleotide regions centered on 



the Su(Hw)-binding motif to evaluate sequence conserva- 
tion among SBS classes. These analyses showed that 
SBS-O and SBS-C sites are more conserved than 
SBS-CM sites, with the most striking differences 
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Figure 3. Distinct classes of SBSs are found in the genome. (A) Shown is a bar plot of the proportion of SBS-O, SBS-C and SBS-CM sites in the 
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corresponding to nucleotides upstream of the Su(Hw)- 
binding motif (Figure 3C). These data imply that add- 
itional constraints are placed upon Su(Hw) occupancy in 
the absence of Mod67.2. MEME analyses of SBS-O and 
SBS-CM SBSs failed to identify a second conserved 
sequence within these upstream regions, indicating the 



absence of a common binding site for another protein. 
Fourth, we defined the distribution of ovary SBS classes 
relative to gene features. Interestingly, these analyses un- 
covered class-specific differences (Figure 3D). We found 
that SBS-CM sites are significantly enriched within 5'-, 
S'-UTRs and exons (P = 2.0e-12). This finding that the 
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gypsy insulator-like subclass is enriched at promoters is 
consistent with predictions that insulators may have 
evolved from promoters (2,36). Fifth, we compared fold 
enrichment values of SBSs in each subclass to determine 
whether Su(Hw) occupancy differed among SBS classes. 
We found that each SBS subclass displayed a similar 
median fold enrichment, indicating that Su(Hw) does 
not display preferential occupancy at one SBS class 
(Figure 3E). 

Effects of the gypsy insulator proteins on 
Su(Hw) occupancy 

Nearly half of SBSs associate with one or both of the 
gypsy insulator proteins Mod6.72 and CP190. We 
investigated whether these proteins were involved in 
Su(Hw) chromosome association. To this end, we 
examined Su(Hw) binding on salivary gland polytene 
chromosomes isolated from mod(mdg4) and Cpl90 
mutant larvae. This strategy was selected because 
salivary gland chromosomes have been widely used to 
provide mechanistic insights into genome-wide protein 
recruitment (37-41). Immunohistochemical staining of 
Su(Hw) on mod(mdg4) mutant chromosomes demon- 
strated that Su(Hw) localization was reduced (Figure 4A), 
a finding consistent with previous reports (42,43). These 
observations suggest that Mod67.2 may facilitate Su(Hw) 
chromosome association or retention. To investigate the 
impact of Mod67.2 on Su(Hw) retention, we randomly 
selected 15 SBS-CM and 15 SBS-O sites. We predicted 
that if Mod67.2 improved Su(Hw) association, then we 
would observe decreased Su(Hw) occupancy specifically 
at SBS-CM sites, whereas changes at SBS-O sites would 
reflect general chromatin changes in mod(mdg4) mutants. 
Su(Hw) occupancy was defined using qPCR analyses, 
revealing that loss of Mod67.2 decreased Su(Hw) occu- 
pancy at 25% (4/15) of SBS-CM sites, while retention at 
-6% of SBS-O sites was affected (Figure 4B). These 
data support a role for Mod67.2 in facilitating Su(Hw) 
chromosome association at SBS-CM sites. In contrast, 
immunohistochemical analysis of Su(Hw) binding on 
Cpl90 mutant polytene chromosomes revealed a 
minimal loss of Su(Hw) (Figure 4A). Direct tests of 
Su(Hw) occupancy -6% of SBS-CM and 30% of SBS-O 
were increased (Figure 4). These data suggest that 
enhanced Su(Hw) association in Cpl90 mutants may be 
due to indirect effects on chromatin structure. Previous 
studies demonstrated that CP 190 largely localizes to pro- 
moters (44), suggesting that these indirect effects may be 
due to changes in gene expression that affect chromatin 
structure. Taken together, our data indicate a role for 
Mod67.2, but not CP190, in facilitating Su(Hw) binding 
to chromatin. 

Mutation of ZnFlO causes widespread loss of 
Su(Hw) binding 

Su(Hw) f carries a defective ZnFlO. To understand 
whether loss of ZnFlO alters Su(Hw) properties 
by changing its association with the gypsy insulator 
proteins, we studied the ability of Su(Hw) to recruit 
Mod67.2 and CP 190 to polytene chromosomes isolated 



from salivary glands. Previous studies documented exten- 
sive co-localization of Su(Hw) and the gypsy insulator 
proteins on salivary gland chromosomes (16,42,43,45,46), 
with near complete co-localization with Mod67.2 and an 
extensive but lower co-localization with CP 190. Polytene 
chromosomes isolated from su(Hw) + and su(Hw) flv 
larvae were stained with antibodies against Su(Hw) and 
co-stained with antibodies against either Mod67.2 or 
CP 190. Based on the quantification of banding patterns 
obtained from split chromosome analyses, we find that 
Su(Hw) f co-localized with Mod67.2 at nearly all genomic 
locations (Figure 5). These data indicate that loss of 
ZnFlO does not strongly affect Mod67.2 in vivo associ- 
ation, although subtle differences may not be detected. 
The nearly unchanged Su(Hw) f recruitment of Mod67.2 
is consistent with observations that a carboxyl-terminal 
region of Su(Hw) outside of the zinc finger domain is 
required for Mod67.2 association (42,47). Similarly, we 
find that Su(Hw) f co-localization with CP 190 is indistin- 
guishable from wild-type Su(Hw), implying that loss of 
ZnFlO does not affect CP 190 in vivo association, 
although subtle differences may not be detected. To 
extend our analyses of Su(Hw) f partnership with 
Mod67.2 and CP190, we used an EMSA assay with 
purified recombinant proteins to investigate effects of 
loss of ZnFlO. Incubation of Su(Hw) + with Mod67.2 or 
CP 190 generated a supershift that becomes more 
pronounced in the presence of both proteins (Suppl- 
ementary Figure SI). The observed supershift parallels 
that previously reported in an EMSA assay documenting 
associations between wild- type Su(Hw) and CP 190 (16). 
Incubation of Su(Hw) f with recombinant gypsy insulator 
proteins produced a similar supershift. Taken together, 
these data support the conclusion that loss of ZnFlO 
does not alter Su(Hw) f association with the gypsy insula- 
tor proteins. 

Previous studies demonstrated that ZnFlO mutant loses 
in vivo occupancy of Su(Hw) at the gypsy insulator and 
some, but not all, non-gypsy SBSs (24,25). As su(Hwf 
females are fertile, we reasoned that genome-wide 
mapping of Su(Hw) f chromosome association would 
provide insights into the germline function of Su(Hw). 
To this end, we identified Su(Hw) f SBSs in the ovary. 
For Su(Hw) f , the oc-Su(Hw) IP sample generated over 
20.6 million uniquely mapped tags, with ~20 million 
obtained both from ChIP with pre-immune serum and 
input chromatin (Supplementary Table SI). Using a 1% 
FDR, we identified 1210 SBSs with a median peak size of 
249 bp (Figure 6A). The Su(Hw) f SBS dataset was 
validated using ChlP-qPCR analyses of a collection of 
randomly selected Su(Hw) f SBSs (referred to as 
f-retained) and SBSs that were lost (referred to as f-lost). 
These analyses showed that Su(Hw) bound all f-retained 
sites above the 1% input threshold, whereas only two 
f-lost sites were above this value (Figure 7A). These 
experiments demonstrate that a high-quality Su(Hw) f 
SBS dataset was obtained. 

Su(Hw) f SBSs largely overlapped with wild-type 
SBSs (95%; Figure 6B). To understand how loss of 
ZnFlO affects in vivo Su(Hw) occupancy, we defined the 
sequence motif associated with f-retained sites. Wild-type 
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Su(Hw) [Su(Hw) + ] binds a 20-bp motif containing two 
highly conserved 9 and 3 bp cores (48). Using the DNA 
motif search program MEME, we found that f-retained 
sites had a similar, but more restricted binding motif, with 
a higher information content [20.9 bits for Su(Hw) f 
relative to 15.0 for Su(Hw) + ; Figure 6C]. These findings 
imply that the binding specificity of Su(Hw) is not 
changed in the absence of ZnFlO. Instead, in vivo selection 
of binding sites is constrained. 

Comparison of Su(Hw) + occupancy at Su(Hw) f retained 
and lost sites showed that f-retained sites had a higher 
fold enrichment (Figure 6D). These data indicated that 
retention might be related to DNA-binding affinity. 



Previous studies had demonstrated that Su(Hw) + and 
Su(Hw) f had nearly identical in vitro DNA-binding 
affinities at the 1A-2 and gypsy insulators, even though 
in vivo binding at both was lost (24). We postulated that 
the presence of clustered binding motifs in these insulators 
might have masked differences in DNA-binding affinity. 
As nearly all endogenous sites contain single Su(Hw)- 
binding motifs, we reasoned that single motif regions 
might display differences in binding between the wild-type 
and mutant proteins. To this end, we randomly selected 
f-retained and f-lost sites and used EMSA assays to define 
apparent binding affinities for Su(Hw) + and Su(Hw) f . 
We found that Su(Hw) + demonstrated an ~22-fold 
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range in binding affinities between single motif SBSs 
(compare sites 4C1 and 43D3; Supplementary Figure S2; 
Table 1). In general, these studies showed that the 
apparent binding affinity of Su(Hw) + for f-retained sites 
was greater than f-lost, differing by an average of 
~ 1.4-fold. Tests of Su(Hw) f revealed that this mutant 
had reduced binding at all sites, showing ~1.6 to 
9.2-fold lower apparent binding affinities than observed 
for Su(Hw) + . In general, Su(Hw) f showed reduced 
binding at f-lost sites compared to f-retained, differing 
by an average of ~2. 3-fold. Even so, the apparent 
binding affinities for Su(Hw) f at f-retained and f-lost 
sites overlap, suggesting that binding affinity is not 
always predictive of Su(Hw) f retention. 

Tissue-specific effects on Su(Hw) f retention 

Based on the absence of a strict correlation between the 
in vitro and in vivo binding data, we postulated that chro- 
matin accessibility might impact Su(Hw) f retention. As a 
first step to address this prediction, we determined 
whether Su(Hw) f retention was correlated with gene struc- 
ture or one of the nine global chromatin states defined by 
the ModEncode project (35). These analyses failed to 
uncover any significant connection with these features 
(Supplementary Figure S3). Next, we reasoned that if 
decreased chromatin accessibility were responsible for dif- 
ferential Su(Hw) f binding, then Su(Hw) binding might 
show tissue-specific differences in site occupancy, as the 
chromatin landscapes are cell type specific. This possibility 
was tested using ChlP-qPCR of chromatin isolated from 
larval wing disc and brain tissues. We found largely 



unchanged Su(Hw) occupancy at f-retained sites in 
these cell types, while occupancy at f-lost sites showed 
tissue-specific differences (Figure 7B). In wing discs, we 
found that 14% (3/21) of the ovary f-lost sites were 
bound, while in brain, 9.5% (2/21) of the ovary f-lost 
sites were bound. These findings suggest minor differences 
in Su(Hw) f occupancy between tissues. 

As SBS subclasses show differences in enrichment at 
gene features, we investigated whether Su(Hw) f retention 
differed among these subclasses. Parsing f- retained sites 
into subclasses showed that SBS-CM sites were over- 
represented, while the proportion of SBS-C and SBS-O 
sites decreased (P = 2.29e-09; Figure 6E). This profile 
links Su(Hw) f retention with the presence of Mod67.2, a 
finding that is consistent our demonstration that Mod67.2 
enhances Su(Hw) + occupancy (Figure 4). 

Ovary expression of gene clusters does not depend 
on Su(Hw) defined boundaries 

The Drosophila genome is organized into clusters of 
co-regulated genes (49-52). We reasoned that if the 
germline function of Su(Hw) depended on an insulator 
activity, then Su(Hw) might be required for proper expres- 
sion of co-regulated ovary gene clusters. To test this 
prediction, we used the FlyAtlas database (53) to 
identify clusters of two or more genes that showed 
2-fold or greater up- or down-regulation in the ovary 
relative to whole fly. From this analysis, we found 772 
down- and 219 up-regulated gene clusters, which we 
refer to as ovary-repressed and ovary-expressed gene 
clusters, respectively. The presence of a larger number of 
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repressed gene clusters is consistent with observations that 
tissue-specific gene expression involves widespread tran- 
scriptional silencing (54). 

Potential Su(Hw) regulated gene clusters were identified 
using our ovary SBS datasets. We postulated that if the 
critical ovary function of Su(Hw) was to define transcrip- 
tional domains, then the ovary gene clusters should be 
delimited by an SBS that retained binding of the fertile 
Su(Hw) f mutant. Using these parameters, the number of 
clusters was reduced to 25 down- and 3 up-regulated gene 
clusters. Of the 34 f-retained sites bordering these domains, 
nearly half (47%) were gypsy insulator-like SBS-CM 
sites, with the remaining corresponding to SBS-O (44%) 
and SBS-C (9%). These observations imply an 



over-representation of the SBS-CM sites at the borders 
of gene clusters. 

To test whether Su(Hw) is required for transcriptional 
regulation of these ovary-regulated gene clusters, we 
measured RNA levels of genes in and neighboring the 
clusters, in a su(Hw) wild-type and mutant backgrounds 
(Figure 8). We predicted that if the essential germline 
function of Su(Hw) involved formation of a chromatin 
insulator, then loss of Su(Hw) would lead to 
cross-regulatory interactions between clustered and neigh- 
boring genes, resulting in an equalization of gene expres- 
sion across the SBS boundary. In our studies, gene 
expression was measured in RNAs isolated from newly 
eclosed wild-type and mutant ovaries. These ovaries 
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were chosen for two reasons. First, newly eclosed ovaries 
contain oocytes developed only to early stages of oogen- 
esis, so that wild-type ovaries do not contain developmen- 
tal stages lost in su(Hw) mutants. Second, newly eclosed 
su(Hw) mutant ovaries display defective development, 
suggesting that gene expression changes caused by loss 
of Su(Hw) should be manifest in these ovaries. QPCR 
analyses of su(Hw) + RNA confirmed the predicted 



patterns of gene expression, wherein genes in up-regulated 
clusters had higher RNA levels than down-regulated 
clusters, displaying 4- to 128-fold higher levels of expres- 
sion than neighboring genes (Figure 8). Measuring RNA 
levels in su( Hw) mutant backgrounds revealed that loss of 
Su(Hw) caused limited changes to expression of the clus- 
tered genes, with the exception of cluster 24D where all 
repressed genes demonstrated further decreased RNA 
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Table 1. Summary of in vitro Su(Hw) and Su(Hw) f DNA-binding analyses 



SBS 


Su(Hw) f Class 


Su(Hw) Binding Motif a 


Apparent K a (M _1 ) b 






Consensus: 


TGTTGCATACTTTTxGGCGC 


Su(Hw) WT 


Su(Hw) f 


Ratio 


4C1 






8.6 ± 3.4 x 10 5 


5.0 ± 0.4 x 10 5 


1 7 


4C15 


Retained 


TGTTGCATACTTGCGGGCGC 


6.0 ±2.6 x 10 6 


2.3 ±1.3 x 10 6 


2.6 


43D3 


Retained 


TACTGCATACTTTTGGGCGC 


1.9 ± 0.4 x 10 7 


3.0 ±0.3 x 10 6 


6.4 


46B3 


Retained 


TGTAGCATACTTTTGGGCGC 


6.6 ±2.6 x 10 6 


4.1 ± 1.6 x 10 6 


1.6 


58B1 


Retained 


TGTAGCATACTTTTCTGGGC 


j.U ± 2.2 x 11) 


1.9 ± U.o x 11) 


2.7 


65D3 


Retained 


CGTAGCATACTTTTAGGTGG 


4.1±.03xl0 6 


1.1 + 0.1 x 10 6 


3.6 








Avg 7.0 x 10 6 


Avg 2.2 x 10 6 


3.2 


2B9 


Lost 


TATTGCATACTTTTAGGTTC 


5.4 ± 0.4 x 10 6 


1.8 ± 1.0 x 10 6 


2.9 


20D1 


Lost 


CACTf3CATACTTTGCGGCAG 


3.5 ± 0.6 x 10 6 


3.9 ± 1.8 x 10 5 


8.9 


43E18 


Lost 


tgctgcatacttIcagHcg 


l.l±0.7xl0 7 


1.2 + 0.1 x 10 6 


9.1 


53C4 


Lost 


TATTJjCATACTTTTGTGCGC 


1.2 ±0.5 x 10 7 


1.3 ± 0.5 x 10 6 


9.2 


55D1 


Lost 


aatt^Bgc^Htaggcgc 


4.2 ± 1.1 x 10 6 


8.7 ± 1.2 x 10 5 


4.8 


85A1 


Lost 


ACTTGCATACTTTTTGGCGC 


3.7±1.7xl0 6 


9.2 ± 0.5 x 10 5 


4.1 


100C1 


Lost 


TGTTGCATACTTTGCACGTG 


2.0 ±0.1 x 10 6 


3.2 ±0.1 x 10 5 


6.1 








Avg 5.2 x 10 6 


Avg 9.5 x 10 5 


5.5 



a A match to the Su(Hw) consensus motif is noted using shading, where dark gray identifies a match to a nucleotide in 
the conserved 9 and 3 bp cores and light gray identifies a match of a residue outside of the conserved cores. 
X represents any nucleotide. b K a , apparent association constant, where M -1 is reverse molar. 



accumulation. Importantly, all clusters retained the dis- 
tinction between ovary-expressed and ovary-repressed 
genes in the su(Hw) null mutant background, suggesting 
that the ovary function of Su(Hw) may not involve for- 
mation of insulators to establish domains of gene 
expression. 

DISCUSSION 

Su(Hw) is a broadly expressed transcription factor that is 
required for oogenesis (21,25). Much of our understanding 
of Su(Hw) function has been obtained through investiga- 
tion of the gypsy insulator. These studies have led to the 
concept that Su(Hw) is an architectural protein involved 
in establishing higher order chromosomal structure critical 
for regulation of gene expression (45). However, emerging 
evidence suggests that the function of Su(Hw) extends 
beyond that of an insulator protein, including the recent 
demonstration that 1A-2, a cluster of two SBSs, is 
required for activation of yar, a non-coding RNA gene 
(55). These data suggest that Su(Hw) has multiple func- 
tions in the genome. 

The function of Su(Hw) in the ovary 

Previous studies estimate that between five to eighteen 
percent of SBSs are cell type specific (33), with evidence 
that 1-3% of SBSs are developmentally regulated (56). 
Here, we used ChlP-seq coupled with extensive ChlP- 
qPCR to show that Su(Hw) chromosome occupancy is 
largely constitutive throughout development (Figures 1 
and 2). While we identified a small set of 'ovary-specific' 
SBSs among the low fold enrichment SBSs, we showed 
that these sites are occupied in non-ovary tissues. Our 
data are consistent with the previous analysis of SBSs in 



the three megabase alcohol dehydrogenase region, in which 
Su(Hw) binding was conserved between different tissues 
(48). Our studies provide a cautionary note for investi- 
gations relying solely on computational evaluation of 
high-throughput genomic datasets, as we find that exten- 
sive validation is required to establish confident binding 
thresholds needed for data interpretation. 

The ovary-specific developmental requirement for 
Su(Hw) may be explained based on its function at the 
gypsy insulator. The insulator properties of Su(Hw) 
suggest that oogenesis may require establishment of 
domain boundaries that permit appropriate gene expres- 
sion in the ovary. To test this postulate, we defined 
genome-wide binding sites for Su(Hw) f , a mutant 
isoform that lacks insulator activity, but retains fertility. 
These studies revealed that Su(Hw) f was retained at only 
one third of wild- type sites (Figure 6). Ostensibly, these 
observations are surprising for an architectural protein, as 
two-thirds of SBSs can be lost without effects on essential 
functions needed for fertility (25). We extended these 
global analyses through direct studies of co-regulated 
gene clusters delimited by f-retained SBSs. We show that 
loss of Su(Hw) has limited, if any, effects on expression of 
these genes in the ovary (Figure 8). Based on these obser- 
vations, we suggest that the essential ovary function of 
Su(Hw) may not be related to establishment of boundaries 
of transcriptional domains, a conclusion supported 
by recent findings that null and nearly null alleles of 
mod(mdg4) and Cpl90 do not affect oogenesis. We 
suggest that Su(Hw) may act locally to change gene 
expression. Recent studies demonstrate that Su(Hw) is 
associated with repressed chromatin domains (35,57) and 
is enriched in lamin-associated domains (58). These obser- 
vations, together with findings that enhancer blocking 
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Figure 8. Loss of Su(Hw) has limited effects on transcription of clusters of co-regulated genes in the ovary. Gene expression studies were completed 
on four gene clusters (34A, 91C, 24D, 63E). Genes showing up-regulated in the ovary relative to other tissues are labeled in green, while genes that 
are down-regulated in the ovary are labeled in cyan. The orientation of the arrow denotes direction of transcription. The position of the SBS is 
shown by the inverted triangle, where SBS-O sites are shown as gray, SBS-C sites as blue and SBS-CM sites as red. Bar graphs of ACt values 
obtained from qPCR analyses of each gene relative to the housekeeping gene RpL32. RNAs analyzed were isolated from newly eclosed ovaries from 
su(Hw) WT (light gray), su(Hw/ (intermediate gray) and su(Hw) nul1 (black) females. Error bars indicate standard deviation of three independent 
biological replicates. 



activity of the gypsy insulator is disrupted by a lamin 
mutation (59), suggest that Su(Hw)-dependent regulation 
may involve gene silencing that requires Su(Hw) targeting 
to the nuclear periphery. 

Three classes of SBSs exist with distinct properties 

The availability of a high-quality dataset of SBSs provided 
the opportunity to investigate the genome-wide associ- 
ation of Su(Hw) with its partner proteins, Mod67.2 and 
CP 190. These analyses showed that SBS-O sites repre- 
sented the largest class (Figure 3). Further, we found 
that SBS-O and SBS-C sites displayed sequence conserva- 
tion that extended beyond the Su(Hw)-binding motif, 
which was not observed for the SBS-CM class. These 
data suggest that Mod67.2 confers greater flexibility 
to Su(Hw) association, a postulate supported by our dem- 
onstration that Mod67.2 facilitates Su(Hw) occupancy 
(Figure 4). These findings imply that the structurally 
related BTB-domain protein CP 190 cannot replace the 
function of Mod67.2 in facilitating Su(Hw) occupancy of 
SBSs. Although SBSs collectively display no enrichment 
with genie features, we find a skewed localization of 
SBS-CM sites to the 5'- and 3'-end of genes and coding 
exons (Figure 3D). Taken together, these data indicate 



that different classes of SBSs may have distinct regulatory 
contributions in the genome. 

Effects of loss of a single ZnF on genome-wide binding 

Su(Hw) has 12 ZnFs, with ten corresponding to C 2 H 2 
fingers and two corresponding to C 2 HC (7). Previous 
studies suggest that the major mode for Su(Hw) chromo- 
some association is DNA binding, as loss of ZnF7 causes 
complete loss of in vivo localization to chromosomes that 
correlates with defective in vitro binding (23,24). We 
demonstrated that loss of ZnF 10 eliminates Su(Hw) f 
occupancy at two-thirds of SBSs (Figure 6), with 
binding site selection of Su(Hw) f showing greater 
constraints than Su(Hw) + . While Su(Hw) f is lost at 
many genomic sites, this protein binds f-lost SBSs 
in vitro, although with reduced affinity relative to 
Su(Hw) + (Supplementary Figure S2). Yet, this reduced 
Su(Hw) f -binding affinity cannot account for all f-lost 
sites, as there is an absence of a strict correlation 
between in vitro DNA binding and in vivo chromosome 
Su(Hw) f occupancy. Further investigation revealed that 
some SBSs showed tissue-specific Su(Hw) f retention and 
that Su(Hw) f retention was optimal at SBSs that associate 
with Mod67.2, a protein partner associated with enhanced 
occupancy of Su(Hw) (Figure 6). Taken together, these 
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data suggest that Su(Hw) f retention is affected by multiple 
factors, including DNA sequence, tissue-specific effects 
that may depend on local chromatin structure and a 
protein partner of the gypsy insulator complex. 

Multi-ZnF domains are the most common DNA- 
binding motif among transcription factors in metazoan 
genomes (60). Our data are relevant to understanding 
how mutation of a single ZnF within a large ZnF- 
binding domain impacts chromatin occupancy of this 
class of transcription factors. We show that individual 
fingers may make distinct contributions to chromosome 
association, without altering the recognition sequence of 
the binding site. Interestingly, a second well-characterized 
vertebrate insulator protein CCCTC-binding factor 
(CTCF) is an eleven ZnF DNA-binding protein (61-65). 
Mutations in the gene encoding CTCF have been found in 
several human tumor samples, including breast, prostate 
and kidney (66). These tumor-associated alleles carried 
missense mutations that changed specific CTCF ZnFs, 
with none producing a truncated protein. Interestingly, 
in vitro studies demonstrated that these CTCF ZnF 
mutants had altered in vitro DNA-binding properties, 
reminiscent of Su(Hw) f . However, no in vivo binding 
studies were completed. Data obtained from analysis of 
Su(Hw) f predict that the cancer-associated CTCF muta- 
tions may alter the in vivo landscape of CTCF occupancy 
genome-wide. As a result, these effects may lead to 
complex changes in gene expression that may promote 
tumorigenesis. 
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